Search CORE

66 research outputs found

Overview of data preprocessing for machine learning applications in human microbiome research

Author: Andrea Simeon
Blaž Stres
Blaž Stres
Blaž Stres
Blaž Stres
Domenica D’Elia
Eliana Ibrahimi
Karel Hron
Laura Judith Marcos-Zambrano
Magali Berland
Marta B. Lopes
Marta B. Lopes
Rajesh Shigdel
Xhilda Dhamo
Publication venue: Frontiers Media S.A.
Publication date: 01/10/2023
Field of study

Although metagenomic sequencing is now the preferred technique to study microbiome-host interactions, analyzing and interpreting microbiome sequencing data presents challenges primarily attributed to the statistical specificities of the data (e.g., sparse, over-dispersed, compositional, inter-variable dependency). This mini review explores preprocessing and transformation methods applied in recent human microbiome studies to address microbiome data analysis challenges. Our results indicate a limited adoption of transformation methods targeting the statistical characteristics of microbiome sequencing data. Instead, there is a prevalent usage of relative and normalization-based transformations that do not specifically account for the specific attributes of microbiome data. The information on preprocessing and transformations applied to the data before analysis was incomplete or missing in many publications, leading to reproducibility concerns, comparability issues, and questionable results. We hope this mini review will provide researchers and newcomers to the field of human microbiome research with an up-to-date point of reference for various data transformation tools and assist them in choosing the most suitable transformation method based on their research questions, objectives, and data characteristics

Directory of Open Access Journals

Applications of Machine Learning in Human Microbiome Studies: A Review on Feature Selection, Biomarker Identification, Disease Prediction and Treatment

Author: Aasmets O
Berland M
Carrillo de, Santa, Pau, E
Claesson MJ
Gruca A
Hasic J
Hron K
Karaduzovic-Hadziabdic K
Klammsteiner T
Kolev M
Lahti L
Loncar Turukalo, T
Lopes MB
Marcos-Zambrano LJ
Moreno V
Moreno-Indias I
Naskinova I
Org E
Paciência I
Papoutsoglou G
Przymus P
Shigdel R
Stres B
Trajkovik V
Truu J
Tsamardinos I
Vilne B
Yousef M
Zdravevski E
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2021
Field of study

The number of microbiome-related studies has notably increased the availability of data on human microbiome composition and function. These studies provide the essential material to deeply explore host-microbiome associations and their relation to the development and progression of various complex diseases. Improved data-analytical tools are needed to exploit all information from these biological datasets, taking into account the peculiarities of microbiome data, i.e., compositional, heterogeneous and sparse nature of these datasets. The possibility of predicting host-phenotypes based on taxonomy-informed feature selection to establish an association between microbiome and predict disease states is beneficial for personalized medicine. In this regard, machine learning (ML) provides new insights into the development of models that can be used to predict outputs, such as classification and prediction in microbiology, infer host phenotypes to predict diseases and use microbial communities to stratify patients by their characterization of state-specific microbial signatures. Here we review the state-of-the-art ML methods and respective software applied in human microbiome studies, performed as part of the COST Action ML4Microbiome activities. This scoping review focuses on the application of ML in microbiome studies related to association and clinical use for diagnostics, prognostics, and therapeutics. Although the data presented here is more related to the bacterial community, many algorithms could be applied in general, regardless of the feature type. This literature and software review covering this broad topic is aligned with the scoping review methodology. The manual identification of data sources has been complemented with: (1) automated publication search through digital libraries of the three major publishers using natural language processing (NLP) Toolkit, and (2) an automated identification of relevant software repositories on GitHub and ranking of the related research papers relying on learning to rank approach.This study was supported by COST Action CA18131 “Statistical and machine learning techniques in human microbiome studies”. Estonian Research Council grant PRG548 (JT). Spanish State Research Agency Juan de la Cierva Grant IJC2019-042188-I (LM-Z). EO was founded and OA was supported by Estonian Research Council grant PUT 1371 and EMBO Installation grant 3573. AG was supported by Statutory Research project of the Department of Computer Networks and Systems

Repositório Aberto da Universidade do Porto

Recommended from our members

Community effort endorsing multiscale modelling, multiscale data science and multiscale computing for systems medicine

Author: Basilio J
Baumbach J
Castiglione F
Chorbev I
Debeljak N
Groen D
Klimek P
Rozman D
Schmid JA
Schmidt HHHW
Stalidzans E
Stres B
Tieri P
Vera J
Zanin M
Zheng H
Publication venue: 'Oxford University Press (OUP)'
Publication date: 05/12/2017
Field of study

© 2017 The Author 2017. Published by Oxford University Press. Systems medicine holds many promises, but has so far provided only a limited number of proofs of principle. To address this road block, possible barriers and challenges of translating systems medicine into clinical practice need to be identified and addressed. The members of the European Cooperation in Science and Technology COST) Action CA15120 Open Multiscale Systems Medicine OpenMultiMed) wish to engage the scientific community of systems medicine and multiscale modelling, data science and computing, to provide their feedback in a structured manner. This will result in follow-up white papers and open access resources to accelerate the clinical translation of systems medicine.Austrian Science Fund: Special Research Program SFB-F54. The European Cooperation in Science and Technology (COST) Action CA15120 OpenMultiMed (http://openmultimed.net)

Brunel University Research Archive

Applications of Machine Learning in Human Microbiome Studies: A Review on Feature Selection, Biomarker Identification, Disease Prediction and Treatment

University of Bergen

NORA - Norwegian Open Research Archives

Diposit Digital de la Universitat de Barcelona

Fondo Bibliográfico Digital Institucional

Contemporary Challenges and Solutions

CA18131 CP16/00163 NIS-3317 NIS-3318 decision 295741 C18/BM/12585940The human microbiome has emerged as a central research topic in human biology and biomedicine. Current microbiome studies generate high-throughput omics data across different body sites, populations, and life stages. Many of the challenges in microbiome research are similar to other high-throughput studies, the quantitative analyses need to address the heterogeneity of data, specific statistical properties, and the remarkable variation in microbiome composition across individuals and body sites. This has led to a broad spectrum of statistical and machine learning challenges that range from study design, data processing, and standardization to analysis, modeling, cross-study comparison, prediction, data science ecosystems, and reproducible reporting. Nevertheless, although many statistics and machine learning approaches and tools have been developed, new techniques are needed to deal with emerging applications and the vast heterogeneity of microbiome data. We review and discuss emerging applications of statistical and machine learning techniques in human microbiome studies and introduce the COST Action CA18131 “ML4Microbiome” that brings together microbiome researchers and machine learning experts to address current challenges such as standardization of analysis pipelines for reproducibility of data analysis results, benchmarking, improvement, or development of existing and new tools and ontologies.publishersversionpublishe

University of Bergen

Repositório da Universidade Nova de Lisboa

EUR Research Repository

Cork Open Research Archive

NORA - Norwegian Open Research Archives

Open Repository and Bibliography - Luxembourg

Utrecht University Repository

Erciyes University - AVESIS

Riga Stradins university

Fondo Bibliográfico Digital Institucional

Phylogenetic congruence and ecological coherence in terrestrial Thaumarchaeota

Author: A Daebeler
A Gobet
A Lopez-Lopez
A Sayavedra-Soto
A Spang
AF Koeppel
AJ Drummond
B Stempfhuber
B Stres
C Brochier-Armanet
C Gubry-Rangin
C Gubry-Rangin
Christopher Quince
Cécile Gubry-Rangin
DA Stahl
DL Sun
DP Martin
E Paradis
E Pereira
Eduard Vico Oton
EF DeLong
Graeme W Nicol
GW Nicol
GW Nicol
H Cao
H Urakawa
I Richter
J Mertens
J Oksanen
JA Balbuena
James I Prosser
JH Gunderson
JI Prosser
JW Leigh
K Hommola
K Zhalnina
L Lu
L Philippot
LE Lehtovirta
LE Lehtovirta-Morley
LJ Revell
M Groussin
M Könneke
M Pester
M Pester
M Tourna
M Tourna
N Stopnisek
NJ Matzke
P Legendre
P Offre
PJ McMurdie
R Angel
R Grosskopf
RC Edgar
RI Griffiths
RJ Alves
RJ Case
S Leininger
SA Placella
SG Acinas
SJ Biller
SN Wood
ST Bates
T Coenye
T Ochsenreiter
U Szukics
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/07/2015
Field of study

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. Acknowledgements We would like to thank Dr Robert Griffith/CEH for providing DNA from soil samples and Dr Anthony Travis for his help with BioLinux. Sequencing was performed in NERC platform in Liverpool. CG-R was funded by a NERC fellowship NE/J019151/1. CQ was funded by a MRC fellowship (MR/M50161X/1) as part of the cloud infrastructure for microbial genomics consortium (MR/L015080/1).Peer reviewedPublisher PD

Aberdeen University Research

Crossref

HAL Descartes

PubMed Central

Warwick Research Archives Portal Repository

University of East Anglia digital repository

Applications of Machine Learning in Human Microbiome Studies: A Review on Feature Selection, Biomarker Identification, Disease Prediction and Treatment

Author: Aasmets Oliver
Berland Magali
Carrillo de Santa Pau Enrique
Claesson Marcus J.
Gruca Aleksandra
Hasic Jasminka
Hron Karel
Karaduzovic-Hadziabdic Kanita
Klammsteiner Thomas
Kolev Mikhail
Lahti Leo
Loncar-Turukalo Tatjana
Lopes Marta B.
Marcos-Zambrano Laura Judith
Moreno Victor
Moreno-Indias Isabel
Naskinova Irina
Org Elin
Paciência Inês
Papoutsoglou Georgios
Przymus Piotr
Shigdel Rajesh
Stres Blaz
Trajkovik Vladimir
Truu Jaak
Tsamardinos Ioannis
Vilne Baiba
Yousef Malik
Zdravevski Eftim
Publication venue: 'Frontiers Media SA'
Publication date: 28/10/2022
Field of study

UTUPub

Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions

The human microbiome has emerged as a central research topic in human biology and biomedicine. Current microbiome studies generate high-throughput omics data across different body sites, populations, and life stages. Many of the challenges in microbiome research are similar to other high-throughput studies, the quantitative analyses need to address the heterogeneity of data, specific statistical properties, and the remarkable variation in microbiome composition across individuals and body sites. This has led to a broad spectrum of statistical and machine learning challenges that range from study design, data processing, and standardization to analysis, modeling, cross-study comparison, prediction, data science ecosystems, and reproducible reporting. Nevertheless, although many statistics and machine learning approaches and tools have been developed, new techniques are needed to deal with emerging applications and the vast heterogeneity of microbiome data. We review and discuss emerging applications of statistical and machine learning techniques in human microbiome studies and introduce the COST Action CA18131 "ML4Microbiome" that brings together microbiome researchers and machine learning experts to address current challenges such as standardization of analysis pipelines for reproducibility of data analysis results, benchmarking, improvement, or development of existing and new tools and ontologies

UTUPub

Comparison of soil bacterial communities in a natural hardwood forest and coniferous plantations in perhumid subtropical low mountains

Author: A Agresti
A Chatterjee
A Sowerby
B Guenet
B Stres
BFT Brockett
BM Tripathi
C Lozupone
CL Lauber
D Liu
DA Lipson
DJ Lane
DR Nemergut
E Gömöryová
H Meng
İ Bolat
J Zimmermann
JF Araujo
K Jangid
KE Ashelford
KE Ashelford
KR Clarke
L Sauheitl
L Zhang
M Högberg
M Sait
M Ushio
ME Lucas-Borja
NJA Curlevski
NL Ward
PD Schloss
PH Janssen
Q Wang
RJ Mitchell
SA Yarwood
UN Nielsen
YM Oh
YT Lin
YT Lin
YT Lin
YT Lin
ZH Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Freeze–thaw cycles have minimal effect on the mineralisation of low molecular weight, dissolved organic carbon in Arctic soils

Author: A Herrmann
A Malik
A. Foster
AC Elliott
B Stres
BP Degens
C Apostel
CR Morley
D. L. Jones
DA Lipson
DL Jones
DL Jones
DL Jones
DL Jones
E Boddy
E Boddy
E Matzner
E Morgner
E. J. Cooper
ED Vance
EJ Førland
G Guggenberger
G Hugelius
GL Tierney
H Glanville
HAL Henry
HAL Henry
HT Koponen
I Kögel-Knabner
J Farrar
J Rousk
JP Schimel
JW Payne
K Fujii
K Hentschel
K Kalbitz
KM Miranda
KS Larsen
LS Vestgarden
M Alexander
M Bird
M Farrell
M Farrell
M Farrell
M Freppaz
M Haei
M Özgül
MK Männistö
MS Strickland
NH Batjes
OA Anisimov
P Grogan
P Roberts
P. Roberts
PAW Hees Van
PR Semenchuk
PW Hill
PW Hill
RL Mulvaney
SA Oswald
SD Goldberg
SH Drotz
SK McMahon
SL Wilson
T Skogland
TG Cartledge
WD Billings
X Feng
X Yu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref